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@ Novel amyloid precursor proteins and metliods of using same. 

@ This invention provides novel nucleic acid molecules which encode amyloid precursor muteins and the 
polypeptides encoded therefrom. Also provided are host vector systems useful for the recombinant production of 
the recombinant polypeptides in procaryotic and eucaryotic systems. Cells comprising the host vector systems 
of this invention as well as methods of recombinantly producing these polypeptides are provided by this 
invention. Further provided is a method to detect the recombinant polypeptides of this invention. 
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nnhU^ references are referred to within parentheses. Disclosures of these 

s T^SlZ' "! '^^^''y incorporated by reference into this application to more fSv 

5 descnbe the state of the art to which this invention pertains. Full bibliographic citations for these ^^SLces 
may be found at the end of this application, immediately preceding the claims references 
acteST'the Znf T -^V'oid in plagues and cerebrovascular deposits are char- 

trtZT. J l °^ individuals suffering from Alzheimer's disease (AD) and Down's Syndrome 

(Glenner and Wong. BBRC. 120:885-890, 1984; Glenner & Wong. BBRC 120-113l-ll5ri9fi4rT! 
0 amyloid deposited in these lesions, referred to as beta amyloid peptide (bApTTs a Zfy so re si^f 
aggregating. 39-42 amino acid (aa) protein which is derived via prLolytic cleSage froTa la g^^^^^^ 
(ZZ^.TT^^-^'^'^V'^''' ^ ^^'"^^ 325:733-736. 1987) BAP also is thoughMo brnerotoxc 
ll al EmII^ J 7Z^'7o'«1?' 'r^- ^''^ '''''''''' ^" '"^^9-' t^-3mem'brane prote" Dyr 
5 IciSce 248-492 1^ S '"^ proteolytically cleaved by "secretase" (Sisodia et a!.. 

L !?nS ^-^^^-^^S' ^990; Esch et aL. Science. 248:1122-1124) between BAP-IBK (lysine) and ~iTl 
988 and rs^uf T P-^'^^- amyloidogenesis (Palmert et al.. BbS 156 432-437 

r^^lri T J^^^^^^ °^ amino-terminai APP fragment which is sIcrTted into tii^e culture 
770 w^- — ■ ^ i^)- "^^^^^ -"^J^^ Of APP (APP-695. APP-751 and APP 

51.. Nature 331.530-532. 1988; and Tanzi. et al.. Nature 331:528-530. 1988). are expressed as IntPnr^ 
transmembrane proteins (Kang et al.. Nature 325-733-736, 198?: Dyrks e al.. EMBO J 7^r9 957 ^gsst 

portiorof Tpp'75Wals:r° ^"^ 'J'' '^^'^^ ^ P-"bitor domafn;?is tL secLd 

ITI -JL J.V I ^^'^'^ " et al.. Science. 248:745-748 1990) 

which IS thought to be involved in cell adhesion (Schubert et al.. Ne^ro^ 3:689-694 "1989) rernodelina 
ZtTT^"'' 2t aL, Science, 248:1 126-Tl 28, 1990) and wound repair' ' 

frnr. APP "^schanisms underlying abnormal proteolytic processes which result in BAP extraction 

991 TsLr J T ' "K"'*°°i ' ^° pathogenesis (Selkoe, Neuron ^487 498 

S6?3366 199lTl°' ; ^^'.flf ' ^''^^ "^"^^^ Natl. Acad. Sci.' 

1991) associated with Alzheimer's Disease 

plagLTitBAp'irBAP^'rnf'; ^'"^'f P'^^^^^ ^^^'^op in AD brains, (b) a major component of 
Plagues s BAP, (c) BAP is generated by proteolytic cleavage of APP protein, (d) mRNA levels of soecific 

Chlrh"f T 'T'"' that more APP protein is expressed, (e) APP point mutations 

^D tch" diseasf n ^"^^^^'"^ have been identified inVLi.ial aT(FAD and 

Dutch disease, ( ) injection of BAP into the brains of rodents both form lesions reminiscent of p aoue 

of a?.nLi" ' "^"^"'^ °f amyloid depos tsTn the brains 

oH^nsgenic mice expressing human APP. it is important to understand how APP is processed to genets 

SUMMARY OF THE INVENTION 

nnJr.TrT^"^'°" TT"*^^ ""^'^'^ ^'^'^ molecuies which encode amyloid precursor muteins and the 
polypeptides encoded therefrom. Also provided are host vector systems useful for t^l rcomWnant 
S V.1" . recombinant polypeptides in procaryotic and eucaryotic systems. Cells cor^prS the 
oro iHpH H "T"^" °' "^^^"^"^^ °f recombinantiy producing these poiypepS are 

invemion. ' ' '^^^^^^ — ^--^ PolypeJSe's of th" 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1: Schematic representation of APP-REP 751. APP-REP 751 represents a cleavable app 

"heTpP ^^^"^"'^^^ °' -™a. flan'k ing egions no to scalef 

yJoMh f/'"*." " ^ ^"^'"^ ^^'"^ (corresponding to APP.751 beginnina at 

Xhol through to and including the glycine codon at 15 amino acid residues N-terminal to BApfand the 

rRPKPQQFFG?^^^^ Z'T' ^"'^ -P°^- «P'^°P-- Substratfp SP)?eporte epJto e 

l?rhTr ? ^ .s inserted at the Xhol site. Met-enkephaline (ME) reporter epitope (YGGFM) is inserted 
at the C-term.nus of APP. The resulting construct encodes 492 amino acids (see Figure 2) 

higure 2: Schematic representation depicting the construction of APP-REP from APP-751 cDNA Partial 
representing N- and C-terminal regions of APP-REP were cloned separately as illustrated below The N 
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terminal partial was constructed by ligating sequences encoding substance P (SP) to an N-terminal 
fragment of APP cDNA. The C-terminal partial was constructed by PGR amplification using the correspond- 
ing portion of APP cDNA to introduce novel ends including the Met-enkephalin (ME) reporter epitope. A 
functional APP-REP 751 clone was obtained by subcloning the partials as Indicated. EcoRI (E), Xhol p<), 

5 Hindlll (H), BamHI (B), Sail (S), Xbal (Kb). 

Figure 3: Epitope mapping of APP-REP 751 expressed in COS-1 cells. Immunoprecipitation analysis of 
cell lysate and conditioned medium using the SP (anti-N-terminal substance P reporter) and M3 (anti-C- 
terminal APP) antisera. Lanes 1 and 2, cell lysate immunoprecipitated with SP and M3 antisera, respec- 
tively: lanes 3 and 4, conditioned medium immunoprecipitated with M3 and SP antisera. respectively; lanes 

10 5 and 6. conditioned medium of control cells transfected with vector DNA immunoprecipitated with SP and 
M3 antisera, respectively; lane M, molecular weight markers. 

Figure 4: Pulse-chase analysis of APP-REP 751. Immunoprecipitation of cell lysate (A) and CM (B). 
COS-1 cells were pulsed with p5S]-methionine for 15 minutes and chased using cold methionine for 0, 0.5, 

1, 1.5. 2 and 4 hours (lanes 1 to 6). Lanes 7. 8 and 9 are chase intervals of 0. 1 and 2 hour for control cells 
75 transfected with vector DNA. Lane M, molecular weight markers. 

Figure 5: Epitope mapping and comparative expression of APP-REP 751, BAPEaaoand BAPa^^~^^.A, 
Schematic representation of relevant BAP (boxed) and flanking amino acid sequences of APP-REP 751, 
BAPE22Qand BAPAii-zsjuxtaposed against the putative transmembrane domain (shadowed). B-F, Im- 
munoprecipitation analysis with antibodies recognizing indicated substance P (SP), KPI domain (KPI), C- 
20 terminal APP (M3) or Met-enkephalin (ME) epitopes; Lane M, molecular weight marker. B, Conditioned 
medium obtained from COS-1 cells expressing APP-REP 751 (lane 3), BAPeaaoOanes 4, 6 and 8), 
BAPAii-28(lanes 5, 7 and 9) or control cells with (lane 2) or without (lane 1) transfection with vector DNA. C, 
Cell lysates obtained from COS-1 cells expressing APP-REP BAPE22Q(ianes 1, 4 and 7), BAPa 11-22 (lanes 

2, 5 and 8) and control cells transfected with vector DNA (lanes 3, 6 and 9). D, Accumulation of secreted 
25 APP-REP 751 fragments in the conditioned medium obtained from COS-1 cells expressing APP-REP 751 

(lanes 2 and 6), BAPe22q (lanes 3 and 8), BAPAii-28(lanes 4 and 7). or control cells transfected with vector 
DNA (lanes 1 and 5), were pulsed with pS]-methionineand chased for 45 (lanes 1-4) or 90 (lanes 5-8) 
minutes with cold methionine. E, Accumulation of secreted APP-REP fragments in the conditioned medium 
obtained from stable (Chinese hamster ovary cells; lanes 1-4) and transient (COS-1 cells; lanes 5 and 6) 

30 expression of APP-REP 751 (lanes 2 and 5), BAP^n-as (lanes 3 and 6). BAPe22q (lane 4), or control cells 
transfected with vector DNA (lane 1). 

Figure 6: Peptide mapping and sequencing of fragments secreted into the conditioned medium 
obtained from Chinese hamster ovary cells stably expressing APP-REP 751, BAPe22q and BAPaii-28. A, 
Schematic representation depicting the APP-REP 751 and related derivative indicating the cleavage 

35 products and relevant carboxy-terminal fragments derived from treating the secreted fragments either with 
BNPS-Skatole (B) or cyanogen bromide. Downward- or upward-facing arrows represent BNPS-Skatole and 
cyanogen bromide cleavage sites, respectively. Amino acid lengths of relevant fragments for mapping or 
sequencing are given. B, BNPS-Skatole treatment of fragments secreted into the conditioned medium 
obtained from CHO cells stably expressing APP-REP 751 or BAPaii-28. Mixture of conditioned medium 

40 containing APP-REP and BAPAn-28(lane 1), or BAPAii^28(lane 2) and APP-REP 751 (lane 3) alone. 
Figure 7: Nucleotide and amino acid sequence of the APP-REP 751 protein. 

Figure 8: Nucleotide and amino acid sequence of APP 770 which also is available from the Genebank 
data base under accession number Y00264. 

45 DETAILED DESCRIPTION OF THE INVENTION 

This invention provides a nucleic acid molecule encoding an amyloid precursor mutein, wherein the 
nucleic acid molecule comprises, from the 5' end to the 3' end, a nucleic acid sequence encoding a marker 
and a nucleic acid sequence encoding the amino terminus of APP up to but not including the sequences 

50 that encode BAP. These nucleic acid molecules may include, but are not limited to the nucleic acid 
molecules selected from the group consisting of pCLL983, pCLL935, pCLL934 and pCLL913. 

This invention also provides a nucleic acid molecule encoding an amyloid precursor mutein, wherein 
the nucleic acid molecule comprises, from the 5' end to the 3' end a nucleic acid sequence encoding BAP 
and a nucleic acid sequence encoding a marker. These nucleic acid molecules may include, but are not 

55 limited to the nucleic acid molecules selected from the group consisting of pCLL947, pCLL914, pCLL937. 
pCLL949 and pCLL957. 

Further provided by this invention is a nucleic acid molecule which comprises the nucleic acid 
molecules defined hereinabove to each other. Method of ligating are well known to those of skill in the art. 
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These nucleic acid molecules may Include, but are not limited to the nucleic acid molecules selected from 
the group consisting of pCLL618. pCLL619. pCLL620. pCLL600. pCLL964. pCLL962. pCLL989 pCLL987 
PCLL990, PCLL988. pCLLSOl . pCLL602, pCLL603, pCLL604. pCLL605. pCLL606 and pCLL607. 

As used herein, the term "amyloid precursor mutein" is intended to encompass an amyloid precursor 
5 protein that is mutated, i.e.. it is derived from a nucleic acid molecule which has changes in its primary 
structure as compared to wild-type amyloid precursor protein (APP). Wild-type APP exists in three isoforms, 
thus, the nucleic acid molecule is changed in its primary structure for each of the three isoforms of wild- 
type APP. As is known to those of skill in the art. a mutation may be a substitution, deletion, or insertion of 
at least one nucleotide along the primary structure of the molecule. The mutations which are encompassed 

10 by this invention are the result of saturation mutagenesis in the regions of APP which are susceptible to 
cleavage by endoproteolytic enzymes. These mutations include deletions of nucleic acids encoding 
particular amino acids, substitution of nucleic acid sequences encoding one amino acid for a different amino 
acid and addition of nucleic acid sequences encoding additional amino acids not present in the wild type 
APP sequence. The term "marker" encompasses any substance capable of being detected or allowing the 

15 nucleic acid or polypeptide of this invention to be detected. Examples of markers are detectable proteins, 
such as enzymes or enzyme substrates and epitopes not naturally occurring in wild-type APP that are 
capable of forming a complex with an antibody, e.g. a polyclonal or monoclonal antibody. In the preferred 
embodiment of this invention, the marker is an epitope that is capable of being detected by a commercially 
available antibody. In one embodiment, the marker is an epitope capable of being detected by a 

20 monoclonal antibody directed to the Substance P, the Met-enkephalin or the c- myc epitope. In the most 
preferred embodiment of this invention, the marker is the c- myc epitopic region. 

The term "BAP region" is defined as the region of APP wherein endoproteolytic cleavage will yield the 
ammo-terminus and the carboxy-terminus of the BAP which is deposited as plagues and cerebrovascular 
amyloid in Alzheimer's disease brain. The function of the "BAP region" is to give rise to BAP which may 

26 function as a neurotoxic and/or neurotrophic agent in the brain and as other functionalities ascribed to BAP. 
The "BAP region" may also be endoproteolytically cleaved by enzymes. Such enzymes may include but 
are not limited to the enzymes muUicatalytic prtenase. propyl-endopeptidase. Cathepsin-B, Cathepsin-D, 
Cathepsin-L, Cathepsin-G or secretase. Secretase cleaves between lysine-16 (K-16) and leucine-17 (L-17) 
where full length BAP comprises the amino acid sequence DAEFRHDSGYEVHHQKLVFFAEDVGSNK- 

30 GAIIGLMVGGWIA. Thus, for the purposes of this invention, the preferred embodiment is a cDNA which 
encodes an RNA which is translated into a protein which is the substrate for endoproteolytic activities which 
generate BAP. 

In addition, for the purposes of this invention, the nucleic acid molecule may be DNA, cDNA or RNA 
However, in the most preferred-embodiment of this invention, the nucleic acid is a cDNA molecule. 

35 This invention also encompasses each of the nucleic acid molecules described hereinabove inserted 
into a vector so that the nucleic acid molecule may be expressed, i.e., transcribed (when the molecule is 
DNA) and translated into a polypeptide in both procaryotic and eucaryotic expression systems. Suitable 
expression vectors useful for the practice of this invention include pSVL (Pharmacia), pRCRSV (Invitrogen) 
pBluescript SK+ (Stratagene), pSL301 (Invitrogen), pUC19 (New England Biolabs). However, in the preferred 

40 embodiment of this invention, the vector pcDNA-1-neo is the expression vector for expression in eucaryotic 
cells. As IS well known to those of skill in the art, the nucleic acid molecules of this invention may be 
operatively linked to a promoter of RNA transcription, as well as other regulatory sequences. As used 
herein, the term "operatively linked" means positioned in such a manner that the promoter will direct the 
transcnption of RNA off of the nucleic acid molecule, An example of a promoter is the human 

45 cytomegalovirus promoter. The vectors of this invention preferably are capable of transcribing and/or 
translating nucleic acid in vitro or in vivo. The recombinant polypeptides produced from the expression of 
the nucleic acid molecules of this invention are also provided. 

A host vector system for the production of the recombinant polypeptides described hereinabove and for 
expressing the nucleic acid molecules of the subject invention are provided. The host vector system 

50 compnses one of the vectors described hereinabove in a suitable host. For the purpose of the invention a 
suitable host may include, but is not limited to a eucaryotic cell, e.g., a mammalian cell, a yeast cell or an 
insect cell for baculovirus expression. Suitable mammalian cells may comprise, but are not limited to 
Chinese hamster ovary cells (CHO cells), African green monkey kidney COS-1 cells, and ATCC HTB14 
(Amencan Type Tissue Culture). Most preferably, the cell lines CRL 1650 and CRL 1793 are used Each of 

55 these are available from the American Type Culture Collection (ATCC) 12301 Parklawn Drive. Rockville 
Maryland U.S.A. 20852. Suitable procaryotic cell may include, but are not limited to bacteria cells HBIQl' 
(Invitrogen). MC1061/P3 (Invitrogen). CJ236 (Invitrogen) and JM109 (Invitrogen). Accordingly, the pro- 
caryotic or eucaryotic cell comprising the vector system of this invention is further provided by this 
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invention. 

As is known to those of skill in the art. recombinant DNA technology involves Insertion of specific DNA 
sequences into a DNA vehicle (vector) to form a recombinant DNA molecule which is capable of being 
replicated in a host cell. Generally, but not necessarily, the inserted DNA sequence is foreign to the 

5 recipient DNA vehicle. I.e., the inserted DNA sequence and DNA vector are derived from organisms which 
do not exchange genetic information in nature, or the inserted DNA sequence comprises information which 
may be wholly or partially artificial. Several general methods have been developed which enable construc- 
tion of recombinant DNA molecules. For example. U.S. Patent No. 4,237.224 to Cohen and Boyer describes 
production of such recombinant plasmids using processes of cleavage of DNA with restriction enzymes and 

10 joining the DNA pieces by known method of ligation. 

These recombinant plasmids are then introduced by means of transformation or transfection and 
replicated in unicellular cultures including procaryotic organisms and eucaryotic organisms and eucaryotic 
cells grown in tissue culture. Because of the general applicability of the techniques described therein, U.S. 
Patent No. 4,237.224 is hereby Incorporated by reference into the present specification. Another method for 

75 introducing recombinant DNA molecules into unicellular organisms is described by Collins and Hohn in U.S. 
Patent No. 4,304,863 which is also incorporated herein by reference. This method utilized a packaging, 
transduction system with bacteriophage vectors (cosmids). 

Nucleic acid sequences may also be inserted into viruses, for example, a vaccinia virus or a 
baculovirus. Such recombinant viruses may be generate, for example, by transfection of plasmids into cells 

20 infected with virus, Chakrabarti et al, (1985) Mol. Cell Biol. 5:3402-3409. 

Regardless of the method used for construction, the recombinant DNA molecule is preferable compati- 
ble with the host cell, i.e., capable of being replicated in the host cell either as part of the host 
chromosomes or as an extrachromosomal element. The recombinant DNA molecule or recombinant virus 
preferable has a marker function which allows the selection of the desired recombinant DNA molecule(s) or 

25 virus, e.g., baculovirus. In addition, if all of the proper replication, transcription and translation signals are 
correctly arranged on the recombinant DNA molecule, the foreign gene will be properly expressed in the 
transformed or transfected host cells. 

Different genetic signals and processing events control gene expression at different levels. For instance, 
DNA transcription is one level, and messenger RNA (mRNA) translation is another. Transcription of DNA is 

30 dependent upon the presence of a promoter which is a DNA sequence that directs the binding of RNA 
polymerase and thereby promotes RNA synthesis. The DNA sequences of eucaryotic promoter differ from 
those of procaryotic promoters. Furthermore, eucaryotic promoters and accompanying genetic signals may 
not be recognized in or may not function in a procaryotic system. 

Similarly, translation of mRNA in procaryotes depends upon the presence of the proper procaryotic 

35 signals which differ from those of eucaryotes. Efficient translation of mRNA in procaryotes requires a 
ribosome binding site called the Shine-Dalgarno (SD) sequence on the mRNA. For a review on maximizing 
gene expression, see Roberts and Lauer (1979) Methods in Enzymology 68:473. 

Many other factors complicate the expression of foreign genes in procaryotes even after the proper 
signals are inserted and appropriately positioned. One such factor is the presence of an active proteolytic 

40 system in E. coli and other bacteria. This protein-degrading system appears to destroy foreign proteins 
selectively. A tremendous utility, therefore, would be afforded by the development of a means to protect 
eucaryotic proteins expressed In bacteria from proteolytic degradation. One strategy is to construct hybrid 
genes in which the foreign sequence is ligated in phase (i.e.. In the correct reading frame) with a 
procaryotic structural gene. 

45 Expression of this hybrid gene results in a recombinant protein product (a protein that is a hybrid of 

procaryotic and foreign amino acid sequences). 

Successful expression of a cloned gene requires efficient transcription of DNA, translation of the mRNA 

and in some instances post-translation modification of the protein. Expression vectors have been developed 

to increase protein production from the cloned gene. In expression vectors, the cloned gene is often placed 
50 next to a strong promoter which is controllable so that transcription can be turned on when necessary. Cells 

can be grown to a high density and then the promoter can be induced to increase the number of 

transcripts. These, if efficiency translated, will result in high yields of polypeptide. This is an especially 

valuable system if the foreign protein is deleterious to the host cell. 

Several recombinant DNA expression systems are described below in the Experimental Procedures 
55 section for the purpose of illustration only, and these examples should not be construed to limit the scope 

of the present invention. 

A method for producing a recombinant polypeptide described hereinabove, is also provided. This 
method comprises growing the host cell containing the nucleic acid of this invention and/or the host vector 
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system of this invention under suitable conditions, permitting production of the polypeptide and recovering 
the resulting recombinant polypeptide produced. 

A method of detecting in a sample the presence of any of the recombinant polypeptides described 
hereinabove Is further provided by this invention. In the preferred embodiment of this invention, the marker 
6 is an epitope directed against an antibody, the epitope of which is not present in the wild-type polypeptide 
or APP derivative. This method comprises obtaining a sample suspected of containing the polypeptide and 
contacting the sample with an antibody directed to the marker. The contacting is done under suitable 
conditions to favor the formation of an antibody-epitope (i.e., antigen) complex, and detecting the presence 
of any complex so formed. The presence of complex being a positive indication that the recombinant 
10 polypeptide is in the sample. In one embodiment of this invention, the antibody is a mouse antibody. In 
another embodiment of this invention, the antibody is a human antibody. In the most preferred embodiment, 
the mouse or human antibody is either a mouse or human monoclonal antibody. 

The antibody is labeled with a detectable marker selected from the group consisting of radioisotopes, 
dyes, enzymes and biotin. For the purposes of this invention, suitable radioisotopes include, but are not 
75 limited to, ^^P, 353. 131 | ^nd 1. 

Suitable samples for the practice of this invention include, but are not limited to conditioned mediua, 
cell lysates and cellular organelle fractions. 

The method of this invention may utilize the recombinant polypeptide for the detection of drugs or 
compounds that inhibit or augment the activity of proteolytic enzymes which cleave APP to generate BAP 

20 fragments. For the purposes of example only, a recombinant polypeptide which contains a Substance-P 
marker epitope on the amino-terminal side of BAP and a Met-enkephalin marker epitope on the carboxy- 
terminal side of BAP. Using commercially available RIA kits (Peninsula), one can measure the amount of 
amino-marker and carboxy-marker in any given sample. Since endoproteolytic activity is shown (see Figure 
3) to allow the release of amino-terminal fragments of APP containing the amino-marker into the conditioned 

25 media while carboxy-terminal APP fragments containing the carboxy-marker remain associated with the cell, 
then RIA which measure the amount of amino-marker in the conditioned medium as a direct result of 
endoproteolytic cleavage activity between the marker epitopes preferable within the "BAP region". Using 
this RIA to the amino-marker, the effect of potential drugs designed to modify endoprotease activity can be 
tested comparing the level of amino-marker in untreated and endoprotease-inhibitor treated samples. If a 

30 difference in non-treated and treated samples is found, then the position of the cleavage or lack of cleavage 
can be verified as with the procedures used in Figures 3 to 6. Thus, the qualitative and quantitative aspects 
of endoproteolytic activity and its inhibition on the recombinant APP mutein is evaluated. The amino-marker 
also is an enzyme such as betagalactosidase which would be released int the conditioned media by the 
action of an appropriate endoprotease. Cell free samples of conditioned media containing the liberated 

?5 enzyme converts a chromogenic substrate into the appropriately colored product (Blue for X-gal and Yellow 
for ONPG) which is measured spectrophotometricallY. Inhibitors of the appropriate endoprotease would 
inhibit the release of betagalactosidase enzyme into the conditioned medium resulting in less colored 
product being observed. 

It is a purpose of this invention to develop a cleavable APP substrate system which represents target 
to sequences of BAP including normal flanking regions to provide recognition sequences for processing 
enzymes. The utilization of a common substrate for parallel strategies involving in vitro cleavage assays 
using cellular extracts in vivo processing assays in tissue culture or bacterial cells, or in conjunction with a 
selection system aimed at cloning BAP-cleaving proteases (or other relevant proteins) is preferred. 

A second purpose of this invention is to develop an APP substrate which is non-cleavable by secretase 
5 in order to better detect other putative abnormal processing events which are hypothesized to potentially 
either compete with secretase for limited substrate, or occur at much lower frequency than secretase and 
whose effects may be otherwise masked by the mass action of secretase. 

Third, secretase-cleavable and -noncleavable APP substrates would provide probes with which to 
investigate cellular posttranslational modifications to APP in an attempt to determine the potential influence 
0 on normal secretase and abnormal BAP "clipping" activities. These areas include, among others, the 
consideration of various known APP point mutations, contribution by different cell/tissue types (normal- or 
AD-specific). the Kunitz Protease Inhibitor domain present in APP-770 and -751 isoforms. APP phosphoryla- 
tion and APP glycosylation. 

Fourth, the ability to detect specific APP proteolytic events, either the normal secretase or the abnormal 
5 BAP-generating activities, would enable the use of strategies which use phenotypic rescue as a marker for 
the cloning of potentially relevant and interesting proteases in tissue culture systems. 
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Overview of the APP-REP Strategy 

To study secretase and BAP-generating pathways, portions of APP cDNA clones are used to engineer a 
panel of APP-REPorter (APP-REP) plasmlds to express "nnarked" proteins representing each of the APP 

5 isoforms (and other APP/BAP sequence alterations: see below) in cultured cells. The system utilizes the 
marker Substance-P (SP) and Met-Enkephalin (ME) which are strategically placed, respectively, on amino- 
and carboxy-terminal sides of BAP. Proteolytic cleavage of APP-REP target substrate is determined by the 
electrophoretic sizing of resulting proteolytic fragments and immunological detection of APP-specific and 
SP and ME reporter epitopes. Deletion of a large central portion of APP sequence also makes APP-REP 

10 readily distinguishable from the endogenous APP isoforms based on size. Moreover, the resolution of 
detecting proteolytic cleavage at different positions within the APP-REP substrate protein is enhanced by 
working with shorter target substrates. Approximate location of cleavage is determined initially by fragment 
sizing and epitope mapping; the exact cleavage site Is later determined by peptide mapping of affin- 
ity/HPLC purified fragments and sequencing of peptide ends. 

75 Plasmids also are derived from these constructs for developing similar strategies to express APP-REP 
protein in cell free reticulocyte transcription-translation and bacterial systems. Mutation of APP-REP 
secretase/BAPase cleavage site (by sequence substitution, deletion or FAD mutations) can reveal putative 
proteolytic activities associated with BAP formation including amino- and carboxy-BAPase activities which 
are predicted to result in altered product fragments lengths. 

20 

FIRST SERIES OF EXPERIMENTS 

Bacterial Strains and Transformation 

25 Transformation of commercially available frozen competent bacteria, maintenance and selection of 
transformants is according to the manufacturer. Strains HB101. DH5a or JM109 (Gibco-BRL) are used for 
the construction of APP-REP in pSK( + ) (Stratagene, La Jolla, CA) and pSL 301 (Invitrogen, San Diego, CA). 
APP-REP is subsequently subcloned into the eucaryotic expression vector pcDNA-1-neo and amplified in 
MC1061/P3 (Invitrogen, San Diego, CA). 

30 

Plasmid Construction 

A cassette approach is used to independently construct portions of the APP-REP plasmid (Figure 2). 
The N-terminal partial includes APP sequences through the Substance P (SP) epitope, while the carboxy- 

35 terminal (C-terminal) partial includes BAP (or sequence variations of BAP) through the Metenkephalin (ME) 
epitope (Figure 1). Plasmid encoding the N-terminal cassette (pCLL935) is constructed by ligating the 
EcoRl-Xhol fragment derived from APP-751 cDNA to a short synthetic Xhol-Hindlll fragment encoding 
Substance P (amino acid 1-11). This product is then ligated into the EcoRI and Hindlll sites of pSK( + ). 
Plasmid encoding the carboxy-terminal (C-terminal) cassette (pCLL947) is constructed by cloning into the 

40 Hindlll-BamHI sites of pSL301 a fragment containing BAP sequences which is amplified by polymerase 
chain reaction. The fragment features a novel 5'-Hindlll site beginning at lysine 638 of APP-751, native BAP 
through APP C-terminal sequences, and a C-terminal fusion including the Metenkephalin epitope followed 
by a stop translation codon and a BamHI site. The resulting pSL301 Hindlll-Sail fragment (including the 
Hindlll-BamHI coding region plus BamHl-Sall polylinker sequences) is then isolated and ligated to the N- 

45 terminal cassette by subcloning into the Hindlil-Sail sites of the SK( + )-based, CMV promoter driven, 
eukaryotic expression vector pcDNA-1-neo (pCLL601), whose polylinker is modified to accommodate the 
APP-REP fragment {pCLL602). Polylinker modification involves the substitution of the Hindlll-Xbal fragment 
with a synthetic one which restores Hindlll, destroys Xbal and introduces novel BamHI-Xabl-Xho-Sall sites, 

50 Tissue Culture Lines 

All cells are obtained from American Type Culture Collection and maintained according to their 
recommendation. They include SV40-transformed African Green monkey kidney COS-1 cells (CRL 1650) for 
transient expression and Chinese hamster ovary CH0-1C6 (CRL 1973) for stable expression systems. 

55 
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Transfection Procedure 



Ce ls are seeded at a density of 2-3 X 10^/100 mm dish and transfected using Lipofectin (Gibco-BRL 
Grand island NY) when -75% confluent. Plasmid DNA (0.5-4 mg) Is diluted in 450 ml of Opti-MEM (Gibco- 
BRL. Grand Island NY), mixed with 450 ml containing 75-100 ml Lipofectin and the mixture incubated at 
room temperature for 20-30 minutes. Addition of DNA-Lipofectin mixture to cells, recovery phase and G418 
selection (Gibco-BRL), when applicable, are according to the manufacturer's protocol. Cells and conditioned 
medium are harvested at 48-72 hours following transfection for assay of APP-REP expression. 

Antisera 

■,..,.^7^^^^^'*'r ^"♦;fe''3=anti-N-terminal APP, mouse monoclonal 22C11 (Boehringer-Mannheim Biochem- 
«n inm"^''?!:^^ ^^"^ ^9^'"^* ^ recombinant fusion protein expressing APP-695 (epitope mapped to 
aa 60-100); anti-KPI rabbit polyclonal, raised against recombinant protein encoded by the Hinfl fragment 
denved from APP-770: and anti-APP C-terminal rabbit polyclonal M3. raised against synthetic APP peptides 
corresponding to APP-770 amino acid residues 649-671. Reporter-specific antiseraranti-substance P rabbit 
polyclonal purchased from Peninsula. Belmont. CA; and anti-Met-enkephalin. rabbit polyclonal, purchased 
from Cambridge. Wilmington. DE. 

Preparation of Radiolabeled APP-REP and Extraction from Conditioned Medium and Cell Lysates 

APP-REP proteins transiently expressed in exponentially growing adherent cells (-4 x 10^) are 
radiolabeled by metabolic incorporation of FS]-methionlne as follows. Cell monolayers are washed twice 
with prelabeling medium (methionine-free D-HEM supplemented with glutamine. sodium pyruvate antibiot- 
ics and 1% dialyzed fetal bovine serum (Gibco-BRL) and incubated for 15 minutes to 4 hours in prelabeling 
rnedium containing 150-450 uCiPS]-methionine (Amersham. SOOCi/mmol). If chased with cold methionine 
the medium is removed following the pulse, the monolayer Is washed with prelabeling medium and 
replaced with 3 ml of the same containing 1 mM cold methionine. 

The conditioned medium is recovered following radiolabeling by aspiration from plates and cell debris 
removed by centrifugation for 10 minutes at 4°C (-300xg). Conditioned medium is immediately supplemen- 
pmcTp n of ®t!f '"1!""*°''^ (PePstatin A. 50 ug/ml; leupeptin. 50 ug/ml; aprotinin. 10 ug/ml; EDTA. 5 mM; 
PMSF. 0.25 mM) and immunoprecipitation buffer (IPB; Sisodia et ah, 1990) for protein analysis. Briefly 3 ml 
of CM is supplemented with 0.75 ml 5X IPB (250 mM Tris. pH 6.8; 750 mM NaCI; 25 mH EDTA- 2 5% 
Nonidet P40; 2.5% sodium deoxycholate) and incubated for 20 minutes at 4°C prior to use 

Lysates are prepared by washing the labeled cell monolayer twice with 5 ml pre-labeling medium and 
directly extracting cells in plates at 4° C with 3.75 ml IX IPB (including protease inhibitors). Cells are 
scraped into the buffer, incubated for 20 minutes at 4»C and lysates clarified of cellular debris bv 
centrifugation for 20 minutes at 10,000xg. 

For radloiodination of cell surface proteins, monolayers are chilled on ice, washed 3 times with 5 ml ice 
cold PBS and labeled at room temperature for 10 minutes following the addition of: 5 ml PBS containing 0 2 

Snm!f'rn"'fM'^?"°^^'^' ^^"^ ^"^'^"'^ ""^ lactoperoxidase (1 mg/ml distilled water, 

hydrogen peroxide solution (freshly prepared by diluting 10 ml of 30% stock in 10 ml of 
PBb added at 0, 3. 6, and 9 minutes of iodination. At 10 minutes, the supernatant is removed and cells 
gently washed with 10 ml of ice cold PBS (containing 10 mM Nal). Four ml of PBS is added and CM and 
cell lysates are prepared as above. 

Immunoprecipitation Analysis 

Aliquots of radiolabeled lysate or conditioned medium representing 4-8x105 cells are thawed on ice 
supplemented with protease inhibitors (see above), boiled for 3 minutes in 0.35% SDS and chilled on ice' 
Samples are preincubated on a shaker for 1.5 hours at 4°C with 2-10 ul 2X of preimmune (or normal rabbit) 
serum and 2 mg Protein A-Sepharose (Sigma; prepared in IX IPB), and insoluble immune removed by 
contnfugation. APP-or reporter epitope-specific antisera (0.1-10 ul) and 2 mg Protein A-Sepharose were 
sim^^larly added and incubated overnight. Specific immune complexes were precipitated, washed 4 times 
M ; ooTL protease Inhibitors), extracted with 20 ul Laemmli sample buffer (Laemmli (1970) 

Nature ^:680-685). boiled for 3 minutes and fractionated by electrophoresis on SDS-polyacrylamide-tris- 
glycine (Bio-Rad Laboratories, Richmond. VA) or SDS-polyacrylamide-tris-tricine Daiichi (Integrated Separa- 
tion Systems, Natick, MA) gels. Gels are then treated with Enlightening Autoradiographic Enhancer ^New 
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England Nuclear, NEF-974) and dried in vacuo with heat and exposed to Kodak X-AR film at -70°C. 
Western (Imnnunoblot) Analysis 

6 Lysate or 10X concentrated conditioned nnedium (Centricon 30 microconcentrator; Amicon, Beverly, 

MA) representing 4-8x10^ cells are supplemented with an equal volume of 2X Laemmli sample buffer, 
boiled for 2 minutes, fractionated by electrophoresis on SDS-polyacrylamide-tris-glycine (Bio-Rad, XX) or 
SDS-polyacrylamide-tris-tricine Daiichi (Integrated Separation Systems, Natick, MA) gels and transblotted 
(Semi-Phor, Hoefer Instruments, San Francisco. CA) to Immobilon-P membrane (Millipore, Bedford. MA), 

10 Membranes are pre-biocked in 10 ml 5% non-fat dry mllk/PBST (PBS with 0.02% Tween-20) for 45 minutes 
at room temperature prior to overnight incubation at 4°C with primary antisera (in fresh pre-blocking 
solution). Blots are then washed, incubated with secondary antibody, washed and developed for horseradish 
peroxidase activity as described (ECL Luminol Kit; Amersham. Arlington Heights, IL). 

75 Peptide Mapping and Determination of the Site of Proteolytic Cleavage by Peptide Sequencing 

The secretase clip site is determined essentially as described (Wang et aL, (1991) J. Biol. Chem. 
266:16960-16964). Approximately 1X10^ CHO cells stably expressing APP-REP are seeded in each 150 
mm dish containing DMEM (complete with 200 ug/ml G418) and incubated for 36 hours. Cells are washed, 

20 preincubated for 6 hours in serum-free medium [MCDB 302 supplemented with antibiotics, L-giutamine (292 
mg/l) and proline 12 mg/l (Sigma) to remove serum components, washed, and incubated for another 72 
hours in fresh serum-free media. 

Serum-free conditioned medium was pooled and cell debris Is removed by centrifugation (10 minutes at 
300xg. then 30 minutes at 100,000xg) and concentrated by acetone precipitation and fractionated by FPLC. 

25 Conditioned medium concentrate is loaded on an anion exchange column (Mono Q: source) and protein Is 
eluted in 20 mM Tris (pH 7.4) over a 0-1 M NaCI gradient. Fractions containing secreted APP are identified 
by immunoblotting (monoclonal antibody 22C11) and relevant samples pooled, desalted (NP-5 column; 
Pharmacia. Piscataway. NJ) and concentrated. Proteins are then denatured, treated with cyanogen bromide 
(in 10% trifluoroacetic acid) and peptides separated by high performance liquid chromatography (Vydac 

30 Ci8 reverse-phase) attached to a FAB-MS unit. Relevant peaks derived from APP-REP 751 and APP-REP 
BAP ii-28are identified by locating those peaks uncommon to both proteins. The C-terminal peptides 
derived from APP-REP BAP 11-28 (predicted 14 amino acid) and APP-REP 751 (predicted 17 amino acid) 
are sequenced (MilliGen solid phase peptide sequencer; Millipore, Burlington, MA). 

36 EXPERIMENTAL RESULTS 

Characterization of APP-REP Expression by Epitope Mapping 

The APP-REP strategy (Figure 1) is system for the expression of marked APP proteins in tissue culture 

40 cells in order to characterize the proteolytic cleavage events. The deletion of 276 amino acid portion 
distinguishes the construct of this invention from endogenously expressed APP on the basis of size, and is 
predicted to increase the resolution of APP-REP fragments resulting from the proteolytic cleavage by 
secretase or other amyloidogenic, BAP-generating cleavage events. Substance P and Met-enkephalin 
marker epitopes strategically placed on either side of BAP enable the immunological detection of N- and C- 

45 terminal fragments, respectively, which result from proteolytic cleavage of APP-REP substrate. 

APP-REP protein transiently expressed in COS-1 cells has been radiolabeled by metabolic incor- 
poration of PS]-methioninein a 60 minute pulse, immunoprecipitated with antisera, and size fractionated by 
gel electrophoresis as demonstrated in Figure 3. Immunoprecipitation with a panel of APP- and APP-REP- 
specific antisera which recognize epitopes mapping at various positions along APP-REP. reveals the 

50 presence of 2 proteins of -63 kDa in cell lysates (including cytoplasmic and membrane associated proteins) 
as shown in Figure 3. The specific detection by antisera directed against the KPI domain, the carboxy- 
terminus of APP (M3, Figure 3A) and Met-enkephalin, as well as by the N-terminal 22C11 monoclonal in 
Western blot analysis (data not shown), suggests that both bands represent the full-length APP-REP protein. 
Although the 492 amino acid APP-REP is predicted to display a mobility of -49-54 KdA, the larger 63 and 

55 76 kDa proteins are expected based on previous observations attributing the aberrant migration properties 
of APP, putatively to post-translational modification like tyrosine-sulfation, glycosylation and phosphorylation 
(Dyrks et aL, (1988) EHSO J. 7:949-957; Weidemann et aL, (1989) Cell 57:1 15-126. 
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Analysis of the conditioned nnedium (CM) collected from those same cells above indicates that an N 
termmal fragment of APP-REP is released into the CM. Figure 3B reveals a shorter -77 kDa fS^^^^ 
.mmunoprec.p.table from CM with KPI and SP antisera (and the 22C11 monoclonal by Western InTsS 

S»rAPP ^ ^ . : ^:6338-6342; Palmert et ah. (1989 a) P.N.A.S. U.S.A. 85:7341-7345). b) indicatino 

CM and a'sho'rt m °' '^'"'^'^^ - the secretion ofTn N-termina fragment s 

CM, and a short membrane associated C-terminal fragment. 

DStiveTof ApJ^^^^^^ Precursor/Product Relationship Between Cell Associated and Secreted 

To show that APP-REP undergoes post-translational modification accounting for the 2 cell associated 
protems. and that the N-terminal APP-REP fragment released into CM is dLed from oneTtLte 
precursors, radiolabeled APP-REP is with a short 15 minute pulse and collected both eel lyTates and CM at 
vanous chase mtervals as shown in Figure 4. Immunoprecipitation analysis reveals S APP REP inTalfv 
m,grates at -63 kDa and is rapidly "chased" up to -76 kDa with conversL rate of le sihan 1 0 1 5 n^nS 
tS a'pp RE^ °irAP?"" ;-^^-"«tative analysis), an observation which is consistent wi^l. the o t o 
tnat APP-REP, like APP, is substrate for posttranslational modifications 

folloI^.H h^^th""' ^^"'^ ^""^^ '"P'^'y disappears (t - -20 minutes) (Figure 4A and 5C) 

followed by the appearance of a shorter -67 kDa band in the CM (Figure 48 and 5C) The released -67 

n'lcS'^rRETd?! f "^^^ ' ' hours). ThTtemporS X 

Z nttZ 1. K °f 3 ^^oner -67 kDa protein in CM. and the recognition of 

APP REP wS i.' ^."^r'V'ir^ P^°t«°'ytic Cleavage o 

APP-REP which is similar to the normal, non-amyloidogenic. "secretase" activity which results in the 
release of an N-terminal APP fragment (Sisodia et al., (1990) Science 248:492-495. 

Cleavagf °' ^^"^ ^"'^"^"^^^ P^^^^"* Proteolytic 

In an attempt to engineer non-cleavable substrates for secretase. APP-REP proteins are expressed 
LaHi -28^ (BA? o^M Boir "Ceavage/recognition site" putativeiy encompassed ^^a resS " 
fBAP nm fin-^r'Vh ^' °' '■®P''^^«"tmg the BAP point mutation found in patients with HCHWA-D 
Prminff'f ? ■ J '^o^struct representing the BAPE22Q mutation results in secretion of an N- 
terminal fragment indistinguishable from the APP-REP protein (Figure 5C). Deletion of extracellular 
uxtamembranous 18 aa (BAP, 11-28). however, still results in the secretion of an N-"erminarAi^P REP 
fragment into the CM (Figure 58). A slightly faster migration of fragment derived from the deT t construe' 
PCLL604 in companson to that of wild-type pCLL602, is consistent with the 18 aa delet^n and a 
corresponding loss of -2 kDa (Figure 5C). Pulse-chase analyses (Figure 5D) indicate that exp^ss^n o^full- 
nTiT"'^' ^'^^^^'^'^ ^"^ the release of fragment iX CM is bo h 

?CHofll ' to that Of the wild-type APP-REP sequence. Chinese hamste ova J 

CHO) ceNs stably expressing APP-REP display results similar to that of transiently expressing COS-1 eel s 
(Figure 5E ) Collectively, these data suggest that the cleavage in each case may be the resu^ of simS 
biochemical events despite the difference in juxtamembranous sequences (Figure 5A) 

Full-Length APP-REP Proteins Are Associated with Plasma Membrane Prior to Cleavage 

In preliminary experiments, detection of the amino-terminal APP-REP fragment in CM and not in cell 
lysa^es, suggests that the putative secretase activity might be plasma membrane-associared. One prld^^o^ 
of this notion is that an N-terminal portion of APP-REP might be (partially) localized to the extraceS^ter 

ZnZT *° '''' '^yP^*^^^'^' cells stably ex^res^oTpP-S^?^^ 

(PCLL602) are subjected to lactoperoxidase-catalyzed iodination to radiolabel only extrace Sar proteins 

aTe^T 0 mTn^fnc'lr'";- ''''''' ^^'^ ^"^'^^^'^ immediately^ot:inttdinS^o 

after a 10 minute incubation. Presence of the -76 kDa APP-REP band in cell lysate should indicate that at 
least a portion of full-length APP-REP is poised extracellularly in association wi?h cell mernb ane ietecl^ 
-67 kD^^^^^^^^^^ int^f ^! i'"'' ' corresponding increased fracl of 

APP-REP sTaved ^"""^ " '"^^^'^ extracellular portion of 
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Peptide Sequencing to Determine the site of Proteolysis 

Fragment secreted into serum-free media derived from CHO cells stably expressing APP-REP with 
wild-type or BAP 11-28 sequences has been analyzed to determine the actual site of proteolytic cleavage 

5 as shown in Figure 6. Peptide mapping by tryptophan-specific cleavage with BNPS-skatole is used to 
roughly determine the approximate position of cleavage in each molecule. Western blot analysis using SP 
antisera following BNPS-skatole treatment (Figure 6B) reveals fragments whose lengths of --10.5 and -9.5 
kDa, corresponding to wild type and BAP n -28 respectively, confirming that cleavage occurs in the C- 
terminal portion of the PN-2-like protein as expected (Figure 6A). To determine the actual position of 

10 cleavage, secreted fragment is partially purified, treated with cyanogen bromide and relevant C-terminal 
peptides derived from APP-REP wild type, 

DISCUSSION 

75 The examined the expression of a truncated form of APP-751, namely APP-REP 751 (pCLL602) is 
examined and its normal cleavage by secretase. Comparison of the nontransfected cells and those 
transfected with APP-REP 751, in both COS-1 transient and CHO stable expression systems, show the 
production of shorter secreted protein derived from APP-REP. Furthermore, upon a prolonged exposure of 
the fluorogram only one band is observed in condition medium. Epitope mapping with antibodies to N- and 

20 C-terminal domains of APP-REP and amino acid sequencing suggest post-translational cleavage at a site 
similar to that reported for intact APP protein and other truncated APP constructs similar to that reported for 
intact APP protein and other truncated APP constructs. Pulse-chase experiments reveal post-translationai 
modifications, believed to be similar to those described for the intact APP protein, in which a single -63 
kDa product is chased up to -76 kDa in the first 30 minutes. Appearance of the -76 kDa cell membrane 

25 associated protein precedes the release of a -67 kDa product into the CM. The released form, which is not 
observed in the cell lysate fraction, steadily accumulates in the conditioned medium well after the -76 kDa 
band has begun to disappear suggesting a precursor-product relationship. These data indicate that the 
APP-REP protein is a good representation of the naturally occurring APP with respect to post-translational 
synthesis, processing, and stability in a tissue culture system. 

30 Epitope mapping of APP-REP 751 mutants suggest that BAP E22a,as well as the BAPAii-28deletion 
constructs, are initially expressed as larger proteins of predicted lengths which subsequently are cleaved to 
release N-terminal fragments into the CM. The pulse-chase experiments Indicate the cell-associated and 
secreted forms accumulate with similar kinetics. 
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TABLE 1 

Construction of app-hvv P artiiaTg 

A. PSK(+) Amino-Terminal Constructs- 

Eo??in! ?J ""f"* Reporter 

Plasn.id App xsofonn Reporter Epitope 

HindllT FraoTT^ftr^t- ) 
PCLL983 APP-695 
PCLL935 APP-751 
PCLL934 APP-770** 



Substance P* 

Substance p 

Substance p 
PCLL913 APP-77n« 

Substance p 



Notes : 



"O^SSSniJi?" <»»"^ived from th. shorter APP- 

?5l"?S^rio^!"' derived tro. th, longer APP- 



It III S^^^^^^-Terminal Constructs: Cloning 
?HinH?;?''»'''^i!?^ Reporter Epitope FusioS 



Plasmid 

PCLL947 
PCLL914 
PCLL937 



Met-Enkepha 1 in ( ME ) 
Fusion at end nf-. 
Full-Length APP 

Transmembrane Domain 
BAP 



Name of Var^a^_1^n 
APP-BAP-APP-ME 
APP-BAP-TM-ME 
APP-BAP-ME 
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TABLE 1 

Cons'truction of APP-REP Partials 

(Continued) 

pSL301' Carboxy-Terminal Full-Length APP-ME 
Constructs: Introduction of Mutations in BAP 
(Hindlll-BamHI/Sall Fracment^ 



Plasmid Met-Enkephalin 

Name Fusion at End of: 

pCLL94 9 E to Q substitution at 

BAP aa#22 
PCLL957 G to A substitution at 

BTaa#10, deletion of BAP 

AA#ll-28 and creation of 

Ndel site 



Name of Variation 
BAP22EQ 

BAP-vaall-28 
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TABLE 5 

APP-REP Constructs Modeling APP Mutations 
Associated with Diseases Involving BAP Deposition 

APP "717" MUTATIONS 

// APP Transmembrane Domain / 



LL LlAPl 







711 


712 


713 


714 


715 


716 


717 


718 


719 






[40 


41 


42] 














PCLL602 


APP* 


GTC 


ATA 


GCG 


ACA 


GTG 


ATC 


GTC 


ATC 


ACC 






V 


I 


A 


T 


V 


I 


V 


I 


T 


PCLL611 


717VI** 


GTC 


ATA 


GCG 


ACA 


GTG 


ATC 


&TC 


ATC 


ACC 






V 


I 


A 


T 


V 


I 


I 


I 


T 


pCLL612 


717VGe 


GTC 


ATA 


GCG 


ACA 


GTG 


ATC 


GGC 


ATC 


ACC 






V 


I 


A 


T 


V 


I 


G 


I 


T 


PCLL613 


717VT$ 


GTC 


ATA 


GCG 


ACA 


GTG 


ATC 


ITC 


ATC 


ACC- 






V 


I 


A 


T 


V 


I 


F 


I 





TABIiE 5 (continued) 



DUTCH DISEASE 



PCLL602 BAP* 



pCLL603 BAP- 
22EQ# 







: V (secretase clip) 








686 


687 


: 688 


689 


690 


691 


692 


693 


694 


[15 


16 


: 17 


18 


19 


20 


21 


22 


23] 


CAA 


AAA 


: TTG 


GTG 


TTC 


TTT 


GCG 


GAA 


GAT 


Q 


K 


: L 


V 


F 


F 


A 


E 


D 


CAA 


AAA 


: TTG 


GTG 


TTC 


TTT 


GCA 


CAA 


GAT 



pCLL606# 



Notes: 

# APP-REP-751 and -770 derived BAP-22EQ constructs. 

** Goate ^ al. (1991) Nature, 219:704-706; Yoshioka ^ al^ 
(1991) BBRC 178 : 1141-1146; Naruse efc (1991) Lancet 332:978- 

979. 

@ Chartier-Harlin et ^ (1991) Nature 353:844-846. 
$ Murrell e£ (1991) Science ISl: 97-99. 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
Gin Lys Leu Val Phe Phe Ala Gin Asp 

1 c: 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

( i ) APPLICANT: . ^ 

(A) NAME: American Cyanarmd Company 

(B) STREET: 1937 West Main Street 

(C) CITY: Stamford 

(D) STATE: Connecticut 

(E) COUNTRY: U.S. A 

(F) POSTAL CODE (ZIP): 06904-0060 

(ii) TITLE OF INVENTION: Novel Amyloid Precursor Proteins and Methods 

of Using Same 



(iii) NUMBER OF SEQUENCES: 27 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release *1.0, Version #1.26 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 93105718 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 196.. 1671 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
AAGCTTGGGG ATCCGCTCTA GAACTAGTGG ATCCCCCGGG CTGCAGGAAT TCGGGGGGGG 
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25 



45 



50 



CAGCGGTAGG COAGAGOACO CGGAGGAGCG TGCGCGGGGC CCCGGGAGAC GGCGGCGGrG 
3CGGCGCGGG CAGAGCAAGG ACGCGGCGGA TOGCACTCGO ACAGCAGCGC ACTCGGTGCC 
CCGGGCAGGG .GGG A. C.G CCO GG. .G GOA 0. C. C.G C.G GGC GGO 

'^n f^i s - m ^; - - - 

20 25 

lIu p^^s i]i tr. iit ;ii aac atg cac 

30 lie Ala Met Phe Cys Gly Arg Leu Asn Met His 

"et SI? til St t« ™= 2" -A GAT CCA TCA GGG ACC AAA 

45 5^ ^"^P Ser Asp Pro Ser Gly Thr Lys 

60 

C- - - A. C.G CAG r.r TGC CAA G^A G.C 

?S J^J ?I? 5:-° - J- CAA CCA G.G 

90 

™r tV. G=?„' a's'S t'?= Jy'J a'^? G^fJ r "° «0 CAT 

95 ^ ^ Lys Gin Cys Lys Thr His 

105 

P?o Sts TTA GTT GGT GAG TTT GTA AGT 

^^s Phe Val lie Pro Tyr Arg Cys Leu Val Gly Glu Phe ?al Ser 

no .,20 

III I- SI - AAG TGC AAA TTC TTA CAC CAG GAG AGG 

140 

".I iz 1% sti ill - - - -c .g 

Ss til tru tti ^ Ts'. Hi i'"" 

160 "rfc '^^'^ Met Leu Leu 

J- JII - - - - - GTA GAG TTT GTG TGT TGC CCA 

185 



75 



120 
180 
231 

279 

327 

376 

423 

471 

519 

567 



615 



711 



759 



55 



20 
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10 



75 



20 



CTG GCT GAA GAA AGT GAC AAT GTG GAT TCT GCT GAT GCG GAG GAG GAT 807 
Leu Ala Glu G1u Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp 
190 195 200 

GAC TCG GAT GTC TGG TGG GGC GGA GCA GAC ACA GAC TAT GCA GAT GGG 855 
Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly 
205 210 215 220 

AGT GAA GAG AAA GTA GTA GAA GTA GCA GAG GAG GAA GAA GTG GCT GAG 903 
Ser Glu Glu Lys Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu 
225 230 235 

GTG GAA GAA GAA GAA GCC GAT GAT GAC GAG GAC GAT GAG GAT GGT GAT 951 
Val Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp 
240 245 250 

GAG GTA GAG GAA GAG GCT GAG GAA CCC TAC GAA GAA GCC AGA GAG AGA 999 
Glu Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu Ala Arg Glu Arg 
255 260 265 

ACC ACC AGC ATT GCC ACC ACC ACC ACC ACC ACC ACA GAG TCT GTG GAA 1047 
Thr Thr Ser He Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu 
270 275 280 

GAG GTG GTT CGA GAG GTG TGC TCT GAA CAA GCC GAG ACG GGG CCG TGC 1095 
Glu Val Val Arg Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys 
285 290 295 300 

CGA GCA ATG ATC TCC CGC TGG TAC TTT GAT GTG ACT GAA GGG AAG TGT 1143 
Arg Ala Met He Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys 
305 310 315 

GCC CCA TTC TTT TAC GGC GGA TGT GGC GGC AAC CGG AAC AAC TTT GAC 1191 
Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp 
320 325 330 

AGA GAA GAG TAC TGC ATG GCC GTG TGT GGG AGC GCC ATT CCT ACA ACA 1239 
Arg Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr 
335 340 345 

GCA GCC AGT ACC CCT GAT GCC GTT GAC AAG TAT CTC GAG CGG CCC AAG 1287 
Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Arg Pro Lys 
350 355 360 

CCC CAG CAG TTC TTT GGC CTG ATG GGA AGC TTG ACA AAT ATC AAG ACG 1335 
Pro Gin Gin Phe Phe Gly Leu Met Gly Ser Leu Thr Asn He Lys Thr 
365 370 375 380 

GAG GAG ATC TCT GAA GTG AAG ATG GAT GCA GAA TTC CGA CAT GAC TCA 1383 
Glu Glu He Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser 
385 390 395 



25 



30 



35 



45 



50 



55 
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15 



25 



30 



P'ro 'rtr tti II' HI 5AG AAC TAT GGG GGC TTC ATG 



35 



80 



1479 



IT. SIT „- S?I - III ;ii - ^; .r. 

k; jjs tit ii^ ji: S - - - Z ..c 

"^20 425 

ni ??a ?I? JI^ 51? JIS f ? t'^. ^"^^ ^'^^ ^^^^ ^AT 

430 tl® ^""^ Met Leu Lys Lys Lys His 

435 

r^'r iTr ill sti ^ z ?i? SI? 

455 460 

If. J- s?^ J- ir„ - - - - 



1623 



1721 



Pro Thr ryr .y. P„, p.e 57= IT. .el Ify If, ^ 

20 485 490 

TAGGATCCAT ATATAGGGCC CGGGTTATAA TTACCTCAGG TCGACCTAGA 

(2) INFORMATION FOR SEQ ID N0:2: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 492 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ser Arg 

^ 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 

25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 

40 45 

- Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 

55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 

^0 75 



45 



50 



55 
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Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 

85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 

165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Glu Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 

245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Arg Glu Arg Thr Thr Ser He 
260* 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Arg Glu Glu Tyr 

325 330 335 

Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
340 345 350 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Arg Pro Lys Pro Gin Gin Phe 
355 360 365 
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Phe Gly Leu Met Gly Ser Leu Thr Asn He Lys Thr Glu Glu He Ser 

Glu val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 

^ 395 400 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 

■ 410 415 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 

10 425 43Q 

ne val lie Thr Leu Val Met Leu Lys Lys Lys His Tyr Thr Ser lie 

440 445 

His His Gly val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

75 ^OO 4gQ 

His Leu ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 

^75 4^0 
Phe Phe Glu Gin Met Gin Asn Tyr Gly Gly Phe Met 
20 490 
(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



25 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGTTTCCTCG GCAGCGGTAG GCGAGAGCAC GCGGAGGAGC GTGCGCGGGG CCCCGGGAGA 
CGGCGGCGGT GGCGGCGCGG GCAGAGCAAG GACGCGGCGG ATCCCACTCG CACAGCAGCG 
CACTCGGTGC CCCGCGCAGG GTCGCGATGC TGCCCGGTTT GGCACTGCTC CTGCTGGCCG 
CCTGGACGGC TCGGGCGCTG GAGGTACCCA CTGATGGTAA TGCTGGCCTG CTGGCTGAAC 



60 
120 
180 
240 



50 



55 
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6 



10 



15 



20 



25 



30 



35 



40 



CCCAGATTGC 


CATGTTCTGT 


GGCAGACTGA 


ACATGCACAT 


GAATGTCCAG 


AATGGGAAGT 


300 


GGGATTCAGA 


TCCATCAGGG 


ACCAAAACCT 


GCATTGATAC 


CAAGGAAGGC 


ATCCTGCAGT 


360 


ATTGCCAAGA 


AGTCTACCCT 


GAACTGCAGA 


TCACCAATGT 


GGTAGAAGCC 


AACCAACCAG 


420 


TGACCATCCA 


GAACTGGTGC 


AAGCGGGGCC 


GCAAGCAGTG 


CAAGACCCAT 


CCCCACTTTG 


480 


TGATTCCCTA 


CCGCTGCTTA 


GTTGGTGAGT 


TTGTAAGTGA 


TGCCCTTCTC 


GTTCCTGACA 


540 


AGTGCAAATT 


CTTACACCAG 


GAGAGGATGG 


ATGTTTGCGA 


AACTCATCTT 


CACTGGCACA 


600 


CCGTCGCCAA 


AGAGACATGC 


AGTGAGAAGA 


GTACCAACTT 


GCATGACTAC 


GGCATGTTGC 


660 


TGCCCTGCGG 


AATTGACAAG 


TTCCGAGGGG 


TAGAGTTTGT 


GTGTTGCCCA 


CTGGCTGAAG 


720 


AAAGTGACAA 


TGTGGATTCT 


GCTGATGCGG 


AGGAGGATGA 


CTCGGATGTC 


TGGTGGGGCG 


780 


GAGCAGACAC 


AGACTATGCA 


GATGGGAGTG 


AAGACAAAGT 


AGTAGAAGTA 


GCAGAGGAGG 


840 


AAGAAGTGGC 


TGAGGTGGAA 


GAAGAAGAAG 


CCGATGATGA 


CGAGGACGAT 


GAGGATGGTG 


900 


ATGAGGTAGA 


GGAAGAGGCT 


GAGGAACCCT 


ACGAAGAAGC 


CACAGAGAGA 


ACCACCAGCA 


960 


TTGCCACCAC 


CACCACCACC 


ACCACAGAGT 


CTGTGGAAGA 


GGTGGTTCGA 


GTTCCTACAA 


1020 


CAGCAGCCAG 


TACCCCTGAT 


GCCGTTGACA 


AGTATCTCGA 


GACACCTGGG 


GATGAGAATG 


1080 


AACATGCCCA 


TTTCCAGAAA 


GCCAAAGAGA 


GGCTTGAGGC 


CAAGCACCGA 


GAGAGAATGT 


1 140 


CCCAGGTCAT 


GAGAGAATGG 


GAAGAGGCAG 


AACGTCAAGC 


AAAGAACTTG 


CCTAAAGCTG 


1200 


ATAAGAAGGC 


AGTTATCCAG 


CATTTCCAGG 


AGAAAGTGGA 


ATCTTTGGAA 


CAGGAAGCAG 


1260 


CCAACGAGAG 


ACAGCAGCTG 


GTGGAGACAC 


ACATGGCCAG 


AGTGGAAGCC 


ATGCTCAATG 


1320 


ACCGCCGCCG 


CCTGGCCCTG 


GAGAACTACA 


TCACCGCTCT 


GCAGGCTGTT 


CCTCCTCGGC 


1380 


CTCGTGACGT 


GTTCAATATG 


CTAAAGAAGT 


ATGTCCGCGC 


AGAACAGAAG 


GACAGACAGC 


1440 


ACACCCTAAA 


GCATTTCGAG 


CATGTGCGCA 


TGGTGGATCC 


CAAGAAAGCC 


GCTCAGATCC 


1600 


GGTCCCAGGT 


TATGACACAC 


CTCCGTGTGA 


TTTATGAGCG 


CATGAATCAG 


TCTCTCTCCC 


1660 


TGCTCTACAA 


CGTGCCTGCA 


GTGGCCGAGG 


AGATTCAGGA 


TGAAGTTGAT 


GAGCTGCTTC 


1620 


AGAAAGAGCA 


AAACTATTCA 


GATGACGTCT 


TGGCCAACAT 


GATTAGTGAA 


CCAAGGATCA 


1680 


GTTACGGAAA 


CGATGCTCTC 


ATGCCATCTT 


TQACCGAAAC 


GAAAACCACC 


GTGGAGCTCC 


1 740 


TTCCCGTGAA 


TGGAGAGTTC 


AGCCTGGACG 


ATCTCCAGCC 


GTGGCATTCT 


TTTGGGGCTG 


1800 



45 



50 
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ACTCTGTGCC 


AGCCAACACA 


GAAAACGAAG 


TTGAGCCTGT 


TGATRPPPGP 


1 clO 1 CaCCG 


1860 


ACCGAGGACT 


GACCACTCGA 


CCAGGTTCTG 


GGTTGACAAA 


TATPAAGAPG 


oAlalaAuATCT 


1920 


CTGAAGTGAA 


GATGGATGCA 


GAATTCCGAC 


ATGACTCAGfi 




UA I C/ATCAAA 


1980 


AATTGGTGTT 


CTTTGCAGAA 


GATGTGGGTT 


CAAACAAAGG 


TGPAATPATT 


^auAL* I CA I GG 


2040 


TGGGCGGTGT 


TGTCATAGCG 


ACAGTGATCG 


TCATCACnTT 




AAOAAOA A A^ 

AAGAAGAAAC 


21 00 


AGTACACATC 


CATTCATCAT 


GGTGTGGTGG 


AGGTTGACGP 


PGPT(^TPAPr^ 


UUAGAGGAGC 


2160 


GCCACCTGTC 


CAAGATGCAG 


CAGAACGGCT 


ACGAAAATCP 


AAPPTAPA AO 


TTCTTTGAGC 


2220 


AGATGCAGAA 


CTAGACCCCC 


GCCACAGCAG 


CCTCTGAAGT 


TG^^ APA/2PA A 


AACCATTGCT 


2280 


TCACTACCCA 


TCGGTGTCCA 


TTTATAGAAT 


AATGTGGGA A 


GAAAPAAA/^O 
UAAAUAAAv^L* 


CGTTTTATGA 


2340 


TTTACTCATT 


ATCGCCTTTT 


GACAGCTGTG 


CTGTAACACA 


AGTAGATGCC 


TO A A OTTr^ A A 

1 uAAC TTGAA 


2400 


TTAATCCACA 


CATCAGTAAT 


GTATTCTATC 


TCTCTTTACA 


TTTTGGTCTC 


*T~AXAOTAr^A-r 

\ AT ACTACAT 


2460 


TATTAATGGG 


TTTTGTGTAC 


TGTAAAGAAT 


TTAGCTGTAT 


CAAACTAGTG 


CATGAATAGA 


2520 


TTCTCTCCTG 


ATTATTTATC 


ACATAGCCCC TTAGCCAGTT GTATATTATT 


CTTGTGGTTT 


2580 


GTGACCCAAT 


TAAGTCCTAC 


TTTACATATG 


CTTTAAGAAT 


CGATGGGGGA 


TGCTTCATGT 


2640 


GAACGTGGGA 


GTTCAGCTGC 


TTCTCTTGCC 


TAAGTATTCC 


TTTCCTGATC 


ACTATGCATT 


2700 


TTAAAGTTAA 


ACATTTTTAA 


GTATTTCAGA 


TGCTTTAGAG 


AGATTTTTTT 


TO O A TO A ^ 

I CCATQACTG 


2760 


CATTTTACTG 


TACAGATTGC 


TGCTTCTGCT 


ATATTTGTGA 


TATAGGAATT 


AAGAGGATAC 


2820 


ACACGTTTGT 


TTCTTCGTGC 


CTGTTTTATG 


TGCACACATT 


AGGCATTGAG 


ACTTCAAGCT 


2880 


TTTCTTTTTT 


TGTCCACGTA 


TCTTTGGGTC 


TTTGATAAAG 


AAAAGAATCC 


CTGTTCATTG 


2940 


TAAGCACTTT 


TACGGGGCGG 


GTGGGGAGGG 


GTGCTCTGCT 


GGTCTTCAAT 


"T* A A A ^ A A 

TACCAAGAAT 


3000 


TCTCCAAAAC 


AATTTTCTGC 


AGGATGATTG 


TACAGAATCA 


TTGCTTATGA 


CATGATCGCT 


3060 


TTCTACACTG 


TATTAGATAA 


ATAAATTAAA 


TAAAATAACC 


CCGGGCAAGA 


CTTTTCTTTG 


31 20 


AAGGATGACT 


ACAGACATTA 


AATAATCGAA 


GTAATTTTGG 


GTGGGGAGAA 


GAGGCAGATT 


3180 


CAATTTTCTT 


TAACCAGTCT 


GAAGTTTCAT 


TTATGATACA 


AAAGAAGATG 


AAAATGGAAG 


3240 


TGGCAATATA 


AGGGGATGAG 


GAAGGCATGC 


CTGGACAAAC 


CCTTCTTTTA 


AGATGTGTCT 


3300 


TCAATTTGTA 


TAAAATGGTG 


TTTTCATGTA 


AATAAATACA 


TTCTTGGAGG 


AGO 


3353 



45 



50 
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(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
70 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



75 



20 



30 



35 



40 



(vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys 
15 10 15 

Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala lie He 
20 25 30 



25 Gly Leu Met Val Gly Gly Val Val lie Ala 

35 40 

(2) INFORMATION FOR SEQ ID NO : 5 : 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Arg Pro Lys Pro Gin Gin Phe Phe Gly Leu Met 
1 5 10 



50 



55 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 21 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

CAT CAA AAA TTG GTG TTC TTT 
His Gin Lys Leu Val Phe Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

His Gin Lys Leu Val Phe Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(ix) FEATURE: 
^ (A) NAME/KEY: CDS 

(B) LOCATION: 1 . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

^0 CAT CAA GAG TTG GTG TTC TTT 21 
His Gin GVu Leu Val Phe Phe 
1 5 



75 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 



25 



35 



50 



His Gin Glu Leu Val Phe Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
{ B ) TYPE : nucl e i c ac i d 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



( ix) FEATURE: 

(A) NAME/KEY: CDS 
40 (B) LOCATION: 1 . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



45 



55 



29 



EP 0 584 452 A1 



CAT CAA GTG TTG GTG TTC TTT 
His GTn Val Leu Val Phe Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

His Gin Val Leu Val Phe Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..21 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12 

CAT CAA GTG TTG GTG TTC TTT 
His Gin Val Leu Val Phe Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

5 His Gin Val Leu Val Phe Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
?o (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .21 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAT CAA AAA TTG GTG CCG TTT 21 
His Gin Lys Leu Val Pro Phe 
1 5 

3b (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Gin Lys Leu Val Pro Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.27 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 

GTC ATA GCG ACA GTG ATC GTC ATC ACC 
Val He Ala Thr Val lie Val He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Val He Ala Thr Val He Val lie Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . , 27 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTC ATA GCG ACA GTG ATC ATC ATC ACC 
Val lie Ala Thr Val lie lie He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 

Val lie Ala Thr Val lie He lie Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..27 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20 

GTC ATA GCG ACA GTG ATC GGC ATC ACC 
Val lie Ala Thr Val He Gly He Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

Val lie Ala Thr Val He Gly He Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GTC ATA GCG ACA GTG ATC TTC ATC ACC 
Val He Ala Thr Val He Phe He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23; 

Val He Ala Thr Val He Phe He Thr 
1 5 



34 



EP 0 584 452 A1 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 - .27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

CAA AAA TTG GTG TTC TTT GCG GAA GAT 27 
Gin Lys Leu Val Phe Phe Ala Glu Asp 
1 5 * 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



30 



(xi ) 



SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Gin Lys Leu Val Phe Phe Ala Glu Asp 
1 5 



36 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 



40 



(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii ) 



MOLECULE TYPE: DNA (genomic) 



(iii) 



HYPOTHETICAL: NO 
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( i v) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CAA AAA TTG GTG TTC TTT GCA CAA GAT 
Gin Lys Leu Val Phe Phe Ala Gin Asp 
1 6 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Gin Lys Leu Val Phe Phe Ala Gin Asp 
1 6 



Claims 

1. A nucleic acid molecule encoding an amyloid precursor mutein. wherein the nucleic acid molecule 
compnses. from the 5' end to the 3' end a nucleic acid sequence encoding a marker and a nucTJ^Tacid 
sequence encoding amino acid up to but excluding nucleic acids encoding BAP domain 

2. A nucleic acid molecule encoding an amyloid precursor mutein. wherein the nucleic acid molecule 

- — bap domam an^d ^^si: 

a"ecrot"clim 2. ""^'^'^ °' ^'^'^ ' "^^^^^ "-'eic 

H^LI^^T """'^"'^ °' ^'^"^ ^' "'^'^'^'^ ^'^'^ i« « nucleic acid molecule 

selected from the group consisting of DNA, cDNA or RNA. moiecuie 

5. The nucleic acid molecule of claim 1. wherein the nucleic acid molecule is selected from the orouo 
consisting of pCLL983. pCLL935. pCLL934 and pCLL913. ^ 

^' 111"^'^"'/?'?.^°'^''"'^ °' '^'^"^ ^' "'^^'^'^ ^^'d "molecule is selected from the qrouo 

consisting of pCLL947, pCLL91 4. pCLL937. pCLL949 and pCLL957. ^ ^ 

^" acid molecule of claim 3, wherein the nucleic acid molecule is selected from the qrouo 

consisting of pCLL619. pCLL620. pCLLei8. pCLL964. pCLL962 pCLL989 001198^ nCn Qon 
PCLL988. PCLL600. pCLL601. pCLL602. pCLLeOs' pCLL604 'pCLL605, pCLL606 and pS^^^^ 

8. A vector comprising the nucleic acid molecule of claim 1 . claim 2 or claim 3. 

9. A cell comprising the nucleic acid molecule of claim 1 , claim 2 or claim 3. 

tl'n m^'''"!"* polypeptide produced by the nucleic acid molecule of claim 1 , produced by the nucleic 
acid molecule of claim 2 or produced by the nucleic acid molecule, of claim 3. 
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A method of detecting the presence of the recombinant polypeptides of claim 10 in a sample, which 
comprises contacting an antibody directed to the marker and the sample under suitable conditions to 
favor the formation of an antibody-antigen complex, and detecting the presence of any complex so 
formed. 
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Figure 1 . 
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Figure 2. 
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Figure 3. 
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■igure 4. 
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Figure 5. 
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Figure 5. 
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Figure 5. 
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Figure 6. 
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S£QU£NCS : 
VECTOR: 



INSERT ; 
SEQUENCE: 



pCLL602 (APP-REP 751 pi:ot«in) 
pcDNA-I-neo (tnvitrogen) 

pcDNA-I-neo-XS <JSJ modified polyiink^c to permit 

aubcioning into Xtoal-Sall sites) 
Xbal-Sall fragment encoding APP-RE? from pCLL964 
5' poiylinker: 

Hindlll-Xbal from pcDNA-I-neo-XS 
Xbal-EcoRI from pBluescript SK+ 
APP-REP 751: 

Amino-terTninal partial from pCLL935) : 
5* untranslated APP cDNA (from EccRI) 
N-terminal APP (to Xhol) 
Substance P marker (Xhol to Hindlll) 
Carboxy-terminal partial from pCLL947) ; 

C-tenainal APP and BAP (from novel Hindlll) 
Met-enkephalin marker (plU3 5top codon) 
3' polylink^r: 

BamHI-Sall from pSL30l 

Sall-«nd of sequence from pcDNA-I-neo-XS 



dirdctiona 

2-47 

2-15. 
16-47 

48-131^. 

48-195 
196-1273 
1274.1314 
1314-1671 
1314-1656 
1657-1674 

1674-1711 
1712-1721 



10 20 30 40 50 

* • * • 

AAGCTTGGGG ATCCGCTCTA GAACTAQTC3C ATCCCCCGGG CTGCAGGAAT 
TTCGAACCCC TA<3GCGAGAT CTTGATCACC TAGGG<3<3CCC GACGTCCTTA 

60 70 80 90 100 

« * * • * 

rCQGGGGQGQ CAGCGGTAGG CGAGAGCACG CCSOAOOAOCQ TGCGCGQGGG 
AGCCCCCCCC GTCGCCATCC GCTCTCOTGC GCCTCCTCGC ACGCGCCCCO 

110 120 130 140 150 

* • « * * 

cccgggagac ggcggcggtg gcggcgcggg cagagcaagg acgcggcgoa 
gggccctctg ccgccgccac cgccqcoccc gtctcgttcc TGCGCCGCCT 

160 170 180 190 

* * * * 

tcccactcgc acagcagcgc ACTCGGTGCC ccgcgcaggg tcqco 

AGGGTGAGCG TGTCGTCGCG TGAGCCACGG GGCGCGTCCC AGCGC 

200 210 220 230 240 

* * * * * 

ATG CTG CCC GGT TTG GCA CTG CTC CTO CTG GCC QCC TGG ACG OCT 
TAC OAC GOG CCA AAC CGT GAC GAG GAC GAC CGO CGG ACC TGC CGA 
Met Leu Pro Gly teu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala> 

250 260 270 280 

* * ♦ * 

CGG GCO CTG GAG GTA CCC ACT GAT GGT AAT GCT GGC CTG CTG GCT GAA 
GCC CGC GAC CTC CAT GGG TGA CTA CCA TTA CGA CCG GAC GAC COA CTT 
Arg Ala Leu Olu Val Pro Thr Aap Gly Asn Ala Gly Leu Leu Ala Glu> 



FIGURE 7 
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FIGURE 7 
(continued) 
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FIGURE 7 
(continued) 



49 



EP 0 584 452 A1 



"'2 

SI IS S IS S ?S ^= 

^h, Tyr Gly JyJ Sy Ln S I" IIS ^ 

«iy Aan Arj Aan Aan Phe Aap Thr Glu 51u> 

1220 1230 1240 

I« IS ?I? g§ ?S I?I S ?S JII 

Cy, v., JJI ?S ?S I?I S S> 

'"2 

?s cgI ?ji si s§ jj? i?i SS ^« 

Thr Pro Aap Ala 2? "J JyJ 2u III ?S ^'^'^ 

~»P j.y> Tyr L«u Glu Ar^ Pro Lya Pro Gin Cln> 



1300 1310 



* 



1320 1330 



, 1340 



* 



1370 1380 1330 

IS S5 ?I? ^ Jl? -I gj cjx 

S*. Glu V.1 .y. Met A.P SI X -A III 

^"2 1430 

cll SI SI JJJ H? S? IS III ^ 

v-i HI, ... JJS ^ SI §s 2; §s -J - 

"50 1460 1470 1480 

?JJ SI JIS JII gj SS i2 £2 ^" 

^.3 aiy Al. - - - CCG CAA CAJ ,A, CGC JOX^ 

'''I 1510 1520 1530 

S? J2 SS ^ 11? S? JI? S2 ^ ^ ^ ^^^'^ 
!!• V.1 ii: 5J? Sit f^P '^^^ '^oo 

inr i^u V*l Mat Uu Lya Lya Lya Gin Tyr Thr Ser> 
"'2 1560 1570 1580 

ju SI SI s s? s? £ <=-^ 

Xl« Hia Hi, 31y V^? A^l S Si P^I G^IJ G^:^> 
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90 1600 1610 1620 1630 

* * * « * 

CGC CAC CTG TCC AAG AT<3 CAQ CAG AAC GOC TAG GAA AAT CCA ACC TAG 
GCG GTG GAC AGO TXC TAC GTC GTC TTO CCG ATG CTT TTA GGT TGG ATG 
Arg Hia L«u S«c Lya M«t Gin Gin Aan Gly Tyr Glu Aan Pro Thr Tyxr> 

1640 1650 1660 1670 1680 

* * * • * 

AAG TTC TTT GAG GAG ATG CAG AAC TAT GGG GGC TTC ATG TAG GAtCCA 

TTC AAG AAA GTC GTC TAC GTC TTG ATA CCC CCG AAG TAC ATC CTAGGT 

Lys Phe Phe Glu Gin Met Gin Aan Tyr Gly Gly Ph» M«t *** 

1690 1700 1710 1720 

« * * * 

TATATAGGGC CCGGGTTAT AATTACCTCA GGTCGACCTA GA 
ATATATCCCG GGCCCAATA TTAATGGAGT CCAGCTGGAT CT 
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Total of 12 
?.s.-lR03>TYPE 
LOCUS 

Human amylo 
ACCESSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
STANDARD 

REFERENCE 
AUTHORS 
JOURNAL 
STANDARD 

COMMENT 



PRI 



FEATURES 
pept 

BASE COUNT 
ORIGIN 



files . 
APP770. 

HUMAFPA4 3353 bp ss-ruRNA 

id A4 mRNA, complete cds . 
Y00264 

amyloid fibril protein; cell surface glycoprotein, 
human (Homo sapiens) . 
Homo sapiens 

Eukaryota; Animalia; Metazoa; Chordata; Vertebrata 
Therxa; Eutheria; Primates; Haplorhini; Catarrhini 

1 (bases 1 to 3353; enum. -146 to 3207, no zero) 
Kang,J., Lemaire, H . G . , UnterbecJc, A . , Salbaum,J.M, 
Th!^^^ Multhaup,G., Beyreuther,K. and Mueller -Hill , 3 
J2!l^^n^ff^°'' Alzheimer's disease amyloid A4 protein rese^±,les 
cell-surface receptor 

Nature 325, 733-736 (1987) 
simple automatic 

2 (bases 1 to 3353; enum. 1 to 3353) 
Mueller Hill,B. 

Unpublished (1987) Submitted to the EMBL data library 
simple automatic 

♦source: tissue-cortex of brain; ♦source: developmental 
stage-5-month-old aborted fetus; 



15-JUN-1989 



Mammalia ; 
Hominidae . 

Masters, C . 



EMBL 


features not 


key 


from 


SITE 


3080 


SITE 


3089 


SITE 


3331 


POLYA 


3353 


from 


to/span 


147 


2234 



description 
polyA signal 
polyA signal 
polyA signal 
polyA site 



922 a 



745 c 



to 
3085 
3094 
3336 
3353 

description' 
amyloid A4 /nomgen-^'APP" /map-"21q21 . 2" 
/hgml_locus_uid-"LG0l36J'* 
867 g 819 t 



App770, Length: 3353 May 29, 1990 11:30 Check: 6510 



1 


AGTTTCCTCG 


GCAGCGGTAG 


GCGAGAGCAC 


GCGGAGGAGC 


GTGCGCGGGG 


51 


CCCCGGGAGA 


CGGCGGCGGT 


GGCGGCGCGG 


GCAGAGCAAG 


GACGCGGCGG 


101 


ATCCCACTCG 


CACAGCAGCG 


CACTCGGTGC 


CCCGCGCAGG 


GTCGCGATGC 


151 


TGCCCGGTTT 


GGCACTGCTC 


CTGCTGGCCG 


CCTGGACGGC 


TCGGGCGCTG 


201 


GAGGTACCCA 


CTGATGGTAA 


TGCTGGCCTG 


CTGGCTGAAC 


CCCAGATTGC 


251 


CATGTTCTGT 


GGCAGACTGA 


ACATGCACAT 


GAATGTCCAG 


AATGGGAAGT 


301 


GGGATTCAGA 


TCCATCAGGG 


ACCAAAACCT 


GCATTGATAC 


CAAGGAAGGC 


351 


ATCCTGCAGT 


ATTGCCAAGA 


AGTCTACCCT 


GAACTGCAGA 


TCACCAATGT 


401 


GGTAGAAGCC 


AACCAACCAG 


TGACCATCCA 


GAACTGGTGC 


AAGCGGGGCC 


451 


GCAAGCAGTG 


CAAGACCCAT 


CCCCACTTTG 


TGATTCCCTA 


CCGCTGCTTA 


501 


GTTGGTGAGT 


TTGTAAGTGA 


TGCCCTTCTC 


GTTCCTGACA 


AGTGCAAATT 


551 


CTTACACCAG 


GAGAGG AT GG 


ATGTTTGCGA 


AACTCATCTT 


CACTGGCACA 


601 


CCGTCGCCAA 


AG AG AC AT GC 


AGTGAGAAGA 


GTACCAACTT 


GCATGACTAC 


651 


GGCATGTTGC 


TGCCCTGCGG 


AATTGACAAG 


TTCCGAGGGG 


TAGAGTTTGT 


701 


GTGTTGCCCA 


CTGGCTGAAG 


AAAGTGACAA 


TGTGGATTCT 


GCTGATGCGG 
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751 


AGGAGGATGA 


CTCGGAioTC 


TGGTGGGGCG 


GAGCAGACAC 


AG> xATGCA 


801 


GATGGGAGTG 


AAGACAAAGT 


AGTAGAAGTA 


GCAGAGGAGG 


AAGAAGTGGC 


O ^ X 


T A fin T(^(^ A A 


f^AAf^AAflAAf; 


rCGATGATGA 


CGAGGAPGAT 

wwnvjvjnv^vjA X 


GAGGATGGTG 

V>J AOw A X Os3 X O 






pp a a p a pppt 


GaGi^a APPPT 


appaapaapp 

A\* o A^(j AA V* 


papapapapa 
^ AC ACs AO A(j A 


7 O 1 




TT/^PP APPAP 


PAPPAPP APP 


APPapar^aGT 


pTriT/^/^a af^a 

X o X OoAno A 


1001 




f^TTPPTAPAA 


PAGCAGCPAG 


TAPPPPTGAT 
xnwWWw X on X 


GPPGTTGAPA 

O W^ O X XOAVA 


10 51 


AGTATCTCGA 


GArAGCTGGG 


GATGAGAATG 


AAPATGPPPA 
nnwn xowwwn 


TTTPPAGAA A 
XXX wv^nonAA 


1101 


GCCAAAGAGA 


GGCTTGAGGC 


CAAGCACCGA 


GAGAGAATGT 


CCPAGGTPAT 
w wwnv30 X v^n x 


1151 


GAGAGAATGG 


GAAGAGGCAG 


AACGTCAAGC 


AAAGAACTTG 


CCTAAAGPTG 
wv^ X fv^nww X w 


1 901 

X ^ VJ X 




AHTTATPPAf^ 


PATTTPPAGG 

wn X X X WW<n\J^7 


AGAAAGTGGA 
nonnno x o wn 


ATPTTTGGAA 
nxw X X xoonn 


X ^ ^ X 


PAf^f^AAf^PAf^ 


PPAAP(^A(^AG 


APAGCAGCTG 


GTGGAGACAC 
w X oonon^nw 


ACATGGPPAG 
nwn X oviwwnvj 


X ^ U X 


Af^TfZr^AAf^PP 


ATf^PTPAATG 
/\ 1 X w An X o 


ACCGCCGCCG 

nv^W wW w w w w w 


CCTGGPCCTG 

WW X www WW X w 


GAGAAPTAPA 


X O O X 


TPAPPf^PTPT 


«»w X O X i. 


PPTPCTPGGC 

WW X WW X Vw«vjV7w 


CTCGTCACGT 
w X w o X w n w o X 


GTTCAATATG 
w X X ^vnn x n x w 


1401 

X *1 W X 


PTAAACiAAf^T 


ATGTCCGCGC 


AGAACAGAAG 


GACAGACAGC 


ACACCCTAAA 


1 4 S 1 

X 1 ^ X 




CATGTGCGCA 

wr\ X w X w WW w«^ 


TGGTGGATCC 

ak WVJ A WW** A, WW 


CAAGAAAGCC 


GCTCAGATCC 

>7W A W«aW'>a X W w 


1 

X «J X 


VJV3 X ^^^rvv^v^ X 


TATGAPAPAr* 


PTPPGTGTGA 
Var X w w w X w X 


TTTATGAGCG 

XXX t\ X O <^ w w ^7 


PATGAATPAG 


X ^ O X 


TPTPTPTPPP 


TGPTPT APA A 


PGTGPPTGPA 


GTGGPPGAGG 
v-7 X oo w w onovj 


AGATTPAGGA 
non X X v,»nown 


X O U X 


THA A^^TT^AT 

1 X X wA X 


Vj/\VJV^ X X xw 


AGAAAGAGPA 

nO«VrV\ wA w w n 


AAAPTATTCA 


GATGACGTCT 

on X wn^.i'O X w X 


X 0<J X 


(^PP A AP AT 


GATT Af^THAA 


PPAAGGATPA 


GTTAPGGAAA 
wX xnwwonn^i 


PGATGPTPTP 
Vwton X o \^ X w X w 


1701 


n X ov^wAX X X 




GAAAACCACC 


GTGGAGCTCC 


TTCCCGTGAA 


1751 


T AG AGTTC 


AGCrTGGACG 

*VO w\,,>i X wwnww 


ATCTCCAGCC 


GT'GGCATTCT 


TTTGGGGCTG 


X O VJ X 


X ^ X X VjV^ w 


AGPPAACACA 


GAAAACGAAG 


TTGAGCCTGT 


TGATGCCCGC 

A W«« * w W W W w 


1851 


w w X w\»» X w w 


APCGAGGACT 

«» w Vw» w f\\3 wn w X 


GACCACTCGA 

\J4»w w<aw A Ww*» 


CCAGGTTCTG 


GGTTGACAAA 


X ? VJ X 


TATPAA(^Arf5 


GAGGA^^ATCT 


PTGAAGTGAA 

w X w AX\w X w«^n 


GATGGATGCA 

w*^ X w w«« X w^^*» 


GAATTCCGAC 

A www<n>^ 


X 7 ^ X 


A T A PTP A^i/^ 


ATAT/^AAi^TT 

A X A X wAAu X X 


P ATP ATP AAA 
w A X ViK A X Vi^nnA 


AATTGGTGTT 
nn X X WW X o X X 


PTTTGCAGAA 
w X X X Ownon^^ 




\J£\ X V} X X X 


PAAAPAAAGG 

V^AAAwA/U%V9w 


TGPAATPATT 
X \9wAn X wn X X 


GGACTCATGG 
uvjnw X wn x ww 


TGGGCGGTGT 

X WW WW WW X w X 


2051 


TGTCATAGCG 


ACAGTGATCG 


TCATCACCTT 


GGTGATGCTG 


AAGAAGAAAC 


2101 


AGTACACATC 


CATTCATCAT 


GGTGTGGTGG 


AGGTTGACGC 


CGCTGTCACC 


2151 


CCAGAGGAGC 


GCCACCTGTC 


CAAGATGCAG 


CAGAACGGCT 


ACGAAAATCC 


2201 


AACCTACAAG 


TTCTTTGAGC 


AGATGCAGAA 


CT 












AGACCCCC 


GCCACAGCAG 


2251 


CCTCTGAAGT 


TGGACAGCAA 


AACCATTGCT 


TCACTACCCA 


TCGGTGTCCA 
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2^01 


TTTATAGAAT 


AATGTG-^^AA 


2351 


ATCGCCTTTT 


GACAGCTGTG 


2401 


TTAATCCACA 


CATCAGTAAT 


2451 


TATACTACAT 


TATTAATGGG 

^fXX X X www 


2501 


CAAACTAGTG 




2551 


TTAGCCAGTT 


GTATATTATT 


2601 


TTTACATATG 


CTTTAAGAAT 


2651 


GTTCAGCTGC 


TTCTCTTGCC 


2701 


TTAAAGTTAA 


ACATTTTTAA 


2751 


TCCATGACTG 


CATTTTACTG 


2801 


TATAGGAATT 


AAGAGGATAC 


2851 


TGCACACATT 


AGGCATTGAG 


2901 


TCTTTGGGTC 


TTTGATAAAG 


2951 


TACGGGGCGG 

* \J\J v^\j >J 


GTGGGGAGGG 


3001 


TCTCCAAAAC 


AATTTTCTGC 


3051 


CATGATCGCT 


TTCTACACTG 


3101 


CCGGGCAAGA 


CTTTTCTTTG 


3151 


GTAATTTTGG 


GTGGGGAGAA 


3201 


GAAGTTTCAT 


TTATGATACA 


3251 


AGGGGATGAG 


GAAGGCATGC 


3301 


TCAATTTGTA 


TAAAATGGTG 


3351 


AGC 





?RMRD3> 



GAAACAAACC CGTTTTATGA TTT^CTCATT 
CTGTAACACA AGTAGATGCC TGAr^CTTGAA 
GTATTCTATC TCTCTTTACA TTTTGGTCTC 
TTTTGTGTAC TGTAAAGAAT TTAGCTGTAT 
TTCTCTCCTG ATTATTTATC ACATAGCCCC 
CTTGTGGTTT GTGACCCAAT TAAGTCCTAC 
CGATGGGGGA TGCTTCATGT GAACGTGGGA 
TAAGTATTCC TTTCCTGATC ACTATGCATT 
GTATTTCAGA TGCTTTAGAG AGATTTTTTT 
TACAGATTGC TGCTTCTGCT ATATTTGTGA 
ACACGTTTGT TTCTTCGTGC CTGTTTTATG 
ACTTCAAGCT TTTCTTTTTT TGTCCACGTA 
AAAAGAATCC CTGTTCATTG TAAGCACTTT 
GTGCTCTGCT GGTCTTCAAT TACCAAGAAT 
AGGATGATTG TACAGAATCA TTGCTTATGA 
TATTACATAA ATAAATTAAA TAAAATAACC 
AAGGATGACT ACAGACATTA AATAATCGAA 
GAGGCAGATT CAATTTTCTT TAACCAGTCT 
AAAGAAGATG AAAATGGAAG TGGCAATATA 
CTGGACAAAC CCTTCTTTTA AGATGTGTCT 
TTTTCATGTA AATAAATACA TTCTTGGAGG 
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